Ten years of morphodynamic data at a micro-tidal urban beach: Cala Millor (Western Mediterranean Sea)

Systematic and sustained high quality measurements of nearshore waves and beach morphology are crucial to understand morphodynamic processes that determine beach evolution, to unravel the effects of global warming on sandy coasts and thus improve forecasting models. In 2011 a comprehensive beach monitoring program, the first in the Mediterranean Sea, started at Cala Millor Beach on the island of Mallorca (Spain). The aim was to provide long-term datasets of near-shore morphodynamics in a carbonate sandy micro-tidal and semi-embayed beach fronted by a Posidonia oceanica seagrass meadow. We present our morphological and hydrodynamical dataset of Cala Millor covering more than a decade. The dataset includes topobathymetries, shoreline positions obtained from video cameras, meteorological parameters from a weather station, currents, as well as waves and sea level from ADCP measurements and sediment size. This free and unrestricted archived dataset can be used to support the modelling of erosion-deposition patterns, calibrate beach evolution models, and as a result to propose adaptation and mitigation actions under different global change scenarios.

observations has been released via open access. The current dataset covers beach evolution and hydrodynamic forcing over more than ten years, including mild and energetic conditions as well as erosion/accretion periods at different spatio-temporal scales. This has thus resulted in a comprehensive dataset for near-shore morphodynamics research.

Study site. Cala Millor beach (CLM) is located on the north-eastern coast of Mallorca in the Western
Mediterranean Sea (Fig. 1). It is a carbonate sandy beach, ~2 km in length, with a concave shape fronted by a boulevard wall over which hotels and residential houses span over a Holocene dune system and the remnants of a man-filled humid zone 19 . CLM is an intermediate beach with a configuration of transverse and crescentic bars 20 which means that there is a large alongshore shoreline variability and also rip channels 15,21 .
The beach bottom is characterized by the cropping out of rock reefs, and at depths from 6 to 35 m there are paleochannels and the seagrass meadow of the endemic Posidonia oceanica, which acts as a cover to sediment exchange and a friction obstacle to waves 22 . The beach sediments at CLM consist of medium carbonate bioclastic marine sands with a median diameter of approximately 1.8Φ. These sediments correspond to a mid-Holocene attached regressive barrier that prograded landward through a foredune and a field of parabolic dunes. CLM is exposed to mild-moderate wave conditions, with a mean significant wave height H s = 0.52 m and peak period T p = 6.1 s. The wave climate is strongly seasonal dominated, characterized by low and short mostly locally-formed waves due to summer sea breezes, and higher and longer well-developed waves during winter that can reach 4 m in height 18 . Tides are negligible, since the daily tidal range is ~0.2 m. Surge components induced by wind or atmospheric pressure can increase the sea level by up to 1 m 23 . However, there is an inter-annual sea-level variation up to 0.5 m due to inter-annual fluctuations of sea-temperature, internal oscillations in the Mediterranean basin, and the interaction of internal currents in the Western Mediterranean sea 24 . topography surveys. Since 2011, regular and systematic topographic surveys have been conducted at CLM.
Two regular field campaigns are scheduled each year: one during early summer, to capture the morphodynamics related to the winter storm exposure (winter profile), and a second during middle autumn, to capture the morphodynamics related to the general mild wave conditions (summer profile). An RTK-GPS (Real-Time Kinematics) system mounted in a backpack is used to measure sand levels in the surf-zone and swash zone areas. These areas are sufficiently shallow that they are not reached by the echo-sounding system. The system also monitors the emerging beach (dry beach). The shallow waters and dry beach topography covers from approximately 1.4 m depth (to ensure overlapping of the RTK-GPS surveys and the echo-sounding paths) to the upper edge of the dry beach (boulevard wall foot). The RTK-GPS surveys consist of U-paths series covering this range with an average distance between paths of 10-15 m. Figure 2e shows an example of the RTK-GPS U paths.
All points are measured in the European Terrestrial Reference System 1989 (ETRS89), with the 2008 European Gravitational Model adapted to the Spanish Network of High Precision Leveling (EGM08-IGN) as vertical reference datum.
From 2011 to 2018, summer field campaigns covered the full extent of the beach (Fig. 2b), while during the winter field campaigns, only profile measurements were undertaken (Fig. 2c). Since autumn 2018, complete bathymetries have been performed during all field campaigns.
The raw 3D point clouds are post-processed to remove outlier measurements and echoes from the echo-sounding by means of the Hypack©SWEEP software. RTK-GPS measurements are also included during the post-processing to obtain high resolution 3D point clouds of sand-levels along the embayment. Point clouds are computed into 1 × 1 m Digital Elevation Models (DEMs), with the sand level taken as the average elevation within a 1 m radius around any grid point. Profiles and gridded bathymetries are interpolated from the post-processed 3D point clouds.
Profiles are extracted with QGIS software 25 through a Delaunay triangulation of the point clouds at the transects shown in Fig. 2c and stored in comma-separated value (CSV) files (see Table 1). Cross-shore profiles are provided at a fixed 1 m cross-shore spacing, extending seawards from the boulevard wall foot to ~10 m water depth (Fig. 2f). DEMs are interpolated also from the 3D point cloud in regular rectangular 10 × 10 m equispaced grids. Video-Monitoring data and shorelines. Video-monitoring systems are a widely used remote sensing method to track the evolution of beach shorelines, near-shore morphodynamics patterns or beach bathymetry [26][27][28][29] . A 5 camera video-monitoring system is used to measure the shoreline position along CLM (Fig. 1). This system is part of the open-source SIRENA video-monitoring system, developed by IMEDEA (Mediterranean Institute of Advanced Studies, CSIC-UIB) in 2007 30 . The video monitoring system, located at 39°59′N and 3°38′E (WGS84) at ~46 m height, captures images at 7.5 Hz starting with the first 10 minutes of each hour of daylight to provide statistical image products (Time Exposure, Snapshots, Variance and TimeStacks). Image products are open-soruce and freely available at https://apps.socib.es/beamon.
Shorelines are digitized biweekly from the georeferenced planview images (ETRS89). Georeferenced planview images from the composition of oblique images are computed using photogeometric techniques (Fig. 3). These techniques account for the rectification of far-field distortions due to lens characteristics (intrinsic calibration), and they link real-world geodetic coordinates and image pixel coordinates (extrinsic calibration). Intrinsic calibration is performed during camera installation by means of the open-source Caltech Camera Calibration Toolbox for Matlab 31 . Extrinsic calibration is obtained using Ground Control Points (GCPs, pixels whose real-world coordinates are known) and the position and orientation (pitch-roll-yaw parameters) of the cameras 32 .
Waves and water depth. Since 2011, wave characteristics and water depth have been recorded by a Nortek Acoustic Wave and Current (AWAC) profiler deployed at 17 m depth, approximately 1 km offshore from CLM ( Fig. 1). This is a combination of a bottom-mounted upward-facing Doppler current profiler (ADCP) and a directional wave gauge. The ADCP measures directional currents along the water column, while directional wave parameters are computed by logging a time-series of pressure, surface velocity and acoustic surface tracking (AST), which estimates the frequency spectrum and other non-directional wave parameters 33,34 . The wave measurement setup uses 1200 samples at 1 Hz starting at the beginning of each hour. Raw data is processed by Nortek QuickWave© software which provides the main wave parameters (non-directional and directional spectrum), surface currents and water depths. Wave dataset spans from 2011/05/20 to the present day, except for a gap from 2019/05/20 to 2019/10/29 due to an instrument failure. real time weather measurements. Weather conditions are measured by a Vaisala weather transmitter (WXT) 520 providing real-time meteorological variables (e.g. atmospheric pressure, temperature, relative humidity, precipitation and wind speed and direction). The station is located close to the video-monitoring station at 49.4 m height.
Sediment grain size. During summer field campaigns, sediment samples are collected at the same positions covering five profiles along the beach (Fig. 2d). Sediment samples from the dry beach are sampled manually. Between 2011 and 2018, samples were analyzed with GRADISTAT v. 4.0, and between 2018 and 2020 with v. 8.0. Most of the samples were thinner than 2 mm, and were therefore analyzed exclusively using the Malvern granulometer and processed by the GRADISTAT spreadsheet. For coarser sediment samples, a 100 gr sample was sieved, using 2, 1 and 0.5 mm sieves. These samples were analyzed by integrating the sieving and laser data, obtaining the percentage corresponding to each size before being processed by GRADISTAT.

Data records
The full archived dataset can be obtained from SOCIB Data Catalog 36 . topobathymetries datasets. Sand levels are provided in four different formats: (i) raw 3D point-clouds in ASCII files (projected to ETRS89); (ii) the DEMs, provided as ASC Arc/Info ASCII grids (projected to ETRS89); (iii) NetCDF files (projected to WGS84); and (iv) beach profiles (projected to ETRS89), in comma-separated files (csv). Table 1 summarizes the metadata of each type of file available in the repository. Winter campaigns from 2011 to 2018 only provided profile csv files and raw 3D point-cloud files.

Shorelines. Bi-monthly shoreline positions are provided in ASCII files that contain the x and y coordinates
Waves and water depth. Hourly wave and water depth measurements are available in ASCII files with filename format 'yyyy-mm-dd_mobims-calamillor_scb-awac00X.wap' (yyyy-mm-dd according to the registration date, with 00X being the ID number of the instrument deployed). Each ASCII ' * .wap' file is accompanied with the corresponding data header file (' * .hdr' extension) that contains all the information related to the instrument deployment, measuring parameters and details of the data contained in the '*.wap' files ( Table 2). The wave and water depth dataset spans from 2011/05/20 up to the present day, except for a gap from 2019/05/20 to 2019/10/29 due to an instrument failure. real-time weather data. Real-time weather data are provided as monthly datasets in netcdf format in the CD1.6 convention. Files account for the following variables: air temperature, air pressure, wind speed, average wind from direction, wind gust speed, direction of wind gust, acyclic wind direction (derived), relative humidity, rain accumulation, rain duration, rain intensity, rain peak intensity and voltage. Each variable measurement is accompanied by its corresponding quality control flag (see section Technical Validation).

Sediment grain size distribution. Sediment grain size is provided in a single spreadsheet ('BMF_GRAD_
Grainsize.xlsx') containing the main statistical parameters from the laser grain size analysis ( Table 3). The first sheet contains the ID code of the samples and their WGS84 Lat/Lon coordinates (Fig. 2d). The rest of the spreadsheets contain the results of the grain-size analysis of all the samples for each single field campaign. Due to the spatio-temporal variability of CLM morphodynamics, some sample records are discontinuous over time, due to water level, wave conditions or rock bed exposure.

technical Validation
Profiles and bathymetries. Errors in sand elevation are variable and depend on sea temperature, bed smoothness, GNSS platform and wave conditions. Several best-practices and quality control procedures are developed before, during and after surveying as well as during processing to ensure the reliability and quality of the resulting dataset. www.nature.com/scientificdata www.nature.com/scientificdata/ Echo-sounding and RTK-GPS surveying. Several best-practices and quality control procedures are involved in developing the bathymetries. Regarding the echo-sounding systems: (i) echo-sounding systems are regularly maintained and annually revised by Hypack©technical support; (ii) the water temperature is accounted for by adjusting the echo-sounding frequency and the sailing speed is maintained below 4 knots; (iii) the on-boat GPS system receives real-time differential corrections from the nearest geodesic nodes, and the echo-sounding system is calibrated automatically in relation to the movement of the boat; and (iv) proof sailing patterns are performed: roll (overlapping paths over flat areas in opposite directions), pitch (overlapping path over slopes in opposite directions), heading (partially overlapping paths in the same direction) and point controls (overlapping crossing paths).
In addition, during field campaigns, and if wave conditions permit, control matching points are measured by both the echo-sounding and the RTK-GPS backpack-mounted systems in order to cross-check the reliability of both methods. Gaps in spatial coverage may occur due to shallow sandbars, wave conditions, and mechanical failures.

Column
Variable Range/Units   www.nature.com/scientificdata www.nature.com/scientificdata/ Similarly, the RTK-GPS software is updated annually and technical maintenance is performed by Leica©technical support. The quality of RTK-GPS measurements is controlled by: (i) a minimum number of accessible satellites (14)(15)(16); (ii) an instantaneous and automatic correction from satellite and the closest geodesic nodes (representing a maximum error on x and y coordinates of 0.08 m) and a maximum elevation deviation threshold for 5-time point measurements set to 0.05 m. Mean errors in x and y coordinates are estimated on 0.03 m, and the maximum z coordinate error on 0.05 m.
Bathymetries and profiles postprocessing. Echo-sounder measurements are post-processed using Hypack©software. A first automatic filter is applied to prevent spike outliers. Corrections on head, pitch and roll, and heave are automatically applied if required. A human-eye review of the echo-sounding measurements is also performed to remove noise sounding. Once echo-sounding measurements have been processed, RTK-GPS topo-bathymetric measurements are added to the points cloud in order to perform a second review of data and to check elevation matching of the common points between RTK-GPS and the echo-sounder. The full data set is then extracted considering cell points of 1 × 1 m in the post-processed 3D point cloud files (see Table 1).

Shorelines.
Processing cameras images to obtain georeferenced planviews involves intrinsic and extrinsic calibrations. Intrinsic calibration involves the optical correction of images due to lens distortion. After the optical correction, the extrinsic calibration generates georeferenced planviews from images by relating image pixels to real world coordinates. Typically, the resolution ranges between 0.5 and 2 pixels for Cala Millor. Conversely, pixel resolution decreases with distance, but a higher resolution (~0.2 m) is obtained at the shore since cameras are oriented to measure shoreline at the centre of the image 32 . Shoreline positions are digitized from the plan-view images. To prevent differential effects of thermal dilatation of the cameras, shorelines are extracted from images taken at 12:00 AM (CEST).
Repeated RTK-GPS shoreline measurements are taken over the year to validate the accuracy of the shoreline extraction method. The calibration is also repeated if camera movements are detected, ensuring a minimum error in the image georeferencing.
Wave and water depth validation. The AWAC-AST is replaced during each field campaign, and the corresponding maintenance and calibration are performed to ensure accurate data acquisition for the next deployment. The AWAC-AST data records also account for the corresponding instrument error code. The error code enables users to determine the data reliability. Further details on the definition of AWAC-AST error codes can be found at https://www.nortekgroup.com/.
real-time weather measurements validation. Meteorological stations are controlled daily. Routine maintenance and calibration of instruments ensure the correct functioning and measuring of the instruments. Instrument accuracy information can be found at www.vaisala.com. Quality control procedures for weather measurements are also performed automatically based on the fulfilment of valid ranges (regional and local ranges) of variable values, gradients and spikes and stationary data. The accomplishment of these controls provides each data with a flag determining its reliability (0 -no quality control performed; 1 -good data; 2 -probably good data; 3 -probably bad data; 4 -bad data; 6 -spike; 9 -missing value). Sediment grain size. Laser particle size measurements using the Malvern instrument follow the international standards for particle size measurements ISO13320-1 37 . The laser grain-size analyser system is regularly cleaned and maintained. A professional revision, maintenance and calibration of the instrument is also performed annually by the Malvern technical support. During the sample analysis, the system requires verification steps related to optical cleaning and system functioning to ensure the quality of the laser measurements. Each sample analysis is accompanied by a residual value, representing the difference between the measurements and analysis fit. If the residual exceeds a threshold (R > 4), the sample analysis is repeated.

Usage Notes
For ease of use, datasets 36 are presented in different formats as input for reading software and numerical models.

Code availability
The gridded bathymetries (ASCII raster and netCDF files) are interpolated from raw sand elevation data by means of the code file 'BMF_Gridded_bathymetry.m' (included in the data repository folder 36 ). Code is written in MATLAB (R2018b) and is fully commented. Although MATLAB is a proprietary language, the '.m' files can be read with a text viewer.